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ABSTRACT 



This document describes the work of SDC's Research 



& Technology Division for 1966. The progress of the 
various studies and activities in the Division is 
described under the following major headings: Advanced 

Programning, Information Processing Research, Programming 
Systems, Data Base Systems, Language Processing & Retrieval, 
Behavioral Gaming and Simulation Research, Education & 
Training, Mathematics & Operations Research, Computer 
Center Department, and Special Service Operations. In 



addi*" jji, the back of the Report contains descriptions 
of Division 'sponsored books, demonstrable programs, 
meetings and col loquia , and professional activities of 
the staff. 
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INTRODUCTION AND OVERVIEW 



Mission 

The mission of SDC's Research & Technology 
Division is to provide support for .the rest of 
the Corporation and for its customers. The pro- 
gram consists of extensive research and technology 
development, particularly in the information 
sciences, and the provision of a new computer 
center for the Corporation. The research and 
technology tasks are intended to develop the 
tools which SDC's customers will use in carrying 
out their functions and which the professional SDC 
employees will use in the fulfillment of their 
contractual commitments to governmental and other 
clients. The computer center is being developed 
in the Research & Technology Division so that the 
latest technological developments will be avail- 
able to all SDC professionals at an early stage. 

Organization 

The program is carried out in three organiza- 
tional units — the Research Directorate, the 
Technology Directorate, and the Computer Center 
Department (see Figure 1). Because of the close 
relationships among these units, the lines of 
demarcation are not clear-cut among them. The 
Research Directorate is generally responsible for 
programs with long-range implicat ons, for which 
there may not be an immediate match to existing 
requirements. The Technology Directorate's work 
spans the interval between these long-range pro- 
grams and appliable technology; it is therefore 
concerned with applied research, pretechnology, 
and technology development, as well as a certain 
amount of fundamental research. For much of the 
Technology Directorate's work, the end objective 
is pretty well known at the time the project is 



undertaken and its relationship to SDC and our 
customers' operations is often fairly clear-cut 
(although the way of achieving those desired ends 
may not be quite so well determined) . The Computer 
Center Department has been developing the software 
for a new computing complex, intended for a wide 
range of corporate users. 

Technical Overview 

The research and technology programs are com- 
posed of several main threads of activities which 
are mutually supporting. Typically, each thread 
includes projects ranging from fairly basic 
research through the development of technology. 

The first of these threads relates to the 
handling of large files of structured information — 
the data base problem. In this area, we are 
concerned with developing a series of tools that 
SDC's professionals and customers can apply to the 
operational problems of data management. He want 
to make it possible for a nonprogrammer to describe, 
organize, and update his data, and then have the 
programming system carry out the mechanical func- 
tions of processing the data. In addition, we 
wish to make it simple for the user to ask for 
data in a manner that is convenient and natural to 
his way of operation and to have the system under- 
stand the details of data storage, organization, 
and conversion. Our interests in this area 
include the content as well as the format of the 
computer output. 

The second main thread of our efforts concerns 
computer programming languages. Most past work 
in this area has consisted of developing languages 

for the professional computer programmer. These 
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languages were aimed at relieving him of some of 
the tedium and bookkeeping necessary to produce 
computer programs. A variety of such languages 
has been produced, including the JOVIAL language 
developed by SDC and used extensively within the 
Corporation and externally. We have augmented 
this traditional approach to programming languages 
with initial efforts at providing languages for 
the nonprogrammer. The overall work spans the 
spectrum from basic theory of programming 
languages through the implementation of compilers 
for several languages of present interest. Encom- 
passed in this domain are the development of new 
aids for the programmer and of techniques for 
managing the computer program deve’ opment process. 

The third thread of our activities concerns the 
executive Systems within which the programs of a 
computing complex operate. In particular, an 
innovative technology of time-sharing, which makes 
it feasible for users to operate on-line in direct 
two-way conversation with the computer, has been 
developed through this work. Our efforts are 
aimed at exploiting the capability inherent in 
time-sharing systems and in understanding the 
basic scheduling and allocation processes 
associated with having a number of programs 
operating more or less simultaneously in the 
computer complex. In addition, much of the 
overall emphasis of our programs in technology 
and of the techniques that we use in research 
revolves around our unusual on-line capabilities; 
we are concerned with the advantages made feasible 
by having the computer directly accessible to the 
users and with exploiting these advantages for the 
various purposes toward which the whole program is 
aimed . 

The fourth thread is concerned with computer 
processing of natural language, that is, English 
as it is spoken and written— as distinct from the 
formatted languages of data base systems or the 
formal computer languages. A sizeable program of 
basic research is being devoted to solving the 
problems of semantic and syntactic analysis of 



text by computer, toward the eventual goal of 
providing capabilities for using subsets of natural 
English as query or command languages for computers, 
and enabling computers to read, understand, and 
generate English text. At the more applied end 
of the spectrum, we are concerned with developing 
tools for automatic classification and indexing 
°f documents and otherwise automating the 
storage and retrieval functions of libraries 
and document centers as well aj of individual 
document holders. 

The fifth thread of our activities relates to 
the processes of education. In this area we are 
concerned with new technologies for instruction, 
particularly the use of programmed materials; 
with the potential offered oy the computer for 
assisting teachers, counselors, and administrators; 
and with the implications of these new technologies 
for school administration, including problems of 
flexible scheduling of resources and personnel. 

This program is continuing to broaden in scope and 
to place increasing emphasis on bringing new 
technologies into use in the public school systems. 

Finally, we have programs in mathematics and 
operations research and in exploring the processes 
of human decision making, and of man-machine 
interaction, through behavioral gaming. Much of 
the mathematical work is of a research nature and 
relates in a general way to corporate interests in 
simulation, modeling, and system analysis. The 
work in behavioral gaming and simulation is leading 
to a better understanding of the nature of group 
decision making and the functioning of organizations , 
and is also resulting in a set of innovative tools 
for the conduct of behavioral games and computer- 
based data analysis. In the long run, we hope to 
see this work lead to improved comprehension of 
the interaction of people within their society and 
to the knowledge that makes it possible to improve 
that society. 

It should be pointed out that these basic threads 
intertwine as we begin to derive appliable tech- 
nologies from than. The operating systems provide 
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the framework within which all of the computer 
programs operate. The data base systems are 
integral to the operating system and will, in 
our plans, provide a data management function for 
all system users. The operating and data base 
systems are written in the programming languages 
that we have been developing; and our language 
processors or compilers are embedded in the 
systems. Although there was a considerable gap 
between the natural language efforts and those 
related to the more formal languages, we are 
beginning to develop systems that combine mixed- 
mode retrieval capabilities of both structured 
and unstructured data. The mathematics and 
operations research personnel provide consulting 
assistance for many of the other research ana 
technology projects. The work in education has 
been focusing on the potential of on-line man- 



Center Department has been developing the complete 
software. A 360 Model 50 was operational from 
October 1965 through July 1966; it is being 
replaced by a time-shared Model 65 which began to 
achieve a useful capability during the latter 
part of the year. 



The changeover in computers is resulting in mixed 



consequences. On the one hand, certain experiments 
have been delayed due to reprogramming, a hiatus 



% 



in machine availability, and the uncertainties 
that inevitably accompany the installation of new 
equipment. On the other hand, many of the projects 
are taking this opportunity to make long— sought 



program changes and design improvements afforded 
by this breathing space and by the more powerful 
capabilities of the new installation. 

Turning to specific projects, the data base area 
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machine interaction, made possible by time-sharing 
technology. The behavioral gaming area is pro- 
ducing novel techniques for human use of computer- 
based consoles and displays. Finally, the Computer 
Center Department will be providing — on an 
operational basis for a variety of users-— the 
data base, time-sharing, and advanced compiler 
concepts developed by the research and technology 
program. 

Progress 

The past year was one of actual or impending 
transition for many of the 70-some R&TD projects 
that utilize a computer, as we moved toward 
replacing existing facilities with a third- 
generation computing installation. Specifically, 
the Phil co 2000, which since 1961 had supported 
many SDC's Independent Research projects and the 
original Systems Simulation Research Laboratory, 
was sold in September 1966. The IBM Q-32 computer, 
which has been supporting the balance of the 
computer-based projects in R&TD on the SDC- 
pioneered Time-Sh *ng System, is scheduled to 
phase out late in >7. Replacing these computers, 
as well as some others at SDC, is a series of 
IBM S/360 machines, for which R&TD's Computer 



was marked by increased usage of our prototype 
data management tools. TSS-LUCID, an on-line data 
management system, was used by about 50 people 
each month during 1966, including many external 
users, for a wide range of data management prob- 
lems. The General-Purpose Display System (GPDS) 
was successfully harnessed to a particular 
application, namely the calculation and display 
of salary maturity curves. Various parts of the 
new Time-Shared Data Management System— an 
integrated set of data handling tools based on 
experience with LUCID, GPDS, and other systems— 
reached promising stages of design, coding, and 
checkout . TDMS is scheduled to be a major 
resource on SDC's 360 Time-Sharing System. 

In the realm of programming languages, the 
techniques of metacompilation received increased 
emphasis as a means of reducing the cost and 
time to produce compilers. A compixer system, 
called META, and an interpretive system, called 
META5, have been developed and refined; they have 
found many useful applications including the 
production of compilers, translation between 
programming languages, data base conversions,, and 
program reformatting. A version of an advanced 
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list processing language, LISP 1.5, was developed 
during the year, and work was Initiated on LISP 2, 
which will provide improved capabilities for 
manipulating complex data structures and perform- 
ing lengthy arithmetic computations. Design 
and coding were well under way for an Interactive 
Programming Support System, intended to give 
the professional programmer the maximum benefit 
of the on-line capability of time-sharing. 

Programs were developed to facilitate automatic 
recognition of handwritten characters, automatic 
flow charting, and automatic code improvement. 
Finally, a handbook to aid managers in preparing 
estimates of computer program development pro- 
jects was published. 



The executive system area underwent profound 
changes, with final improvements made to the 
Time-Sharing System (TSS) on the Q-32, and the 
development of a new system on the 360. One of 
the first general-purpose time-sharing systems, 
TSS became operational in June 1963 and has been 
continually improved and refined over the years, 
to the point where its sponsor, the Advanced 
Research Projects Agency, felt that it had well 
fulfilled its purpose as a developmental research 
project. During 1966, the system was made 
available to interested users on a subscription 
basis; ARPA support is no longer required and 
has been redirected to more developmental efforts. 
The more than 500 users who previously had free 
access to the system are now limited to those 
who use it on a paid basis. During the year, a 
comparison of on-line versus off-line programmer 
performance resulted in qualified advantages for 
time-sharing. Additionally, a small step was 
taken toward the next important area in executive 
systems — the connecting of several computers into 
a network— by the linking of the Q-32 computer in 
Santa Monica to the TX-2 computer at Lincoln 
Laboratory in Boston. 



In the natural language area, several new 
studies were initiated to complement the existing 
work on automated language processing. The new 



studies, sponsored by the Advanced Research Pro- 
jects Agency, include a computer-based semantic 
analysis of sense relationships of the words in 
a dictionary, the development of an on-line 
transformational grammar tester, and a study of the 
use of English subsets in query systems. Addi- 
tionally, Protosynthex III, a fairly complete 
approach to a natural language processor, emerged 
from the preliminary design stage. In the library 
application area, BOLD, a highly automated display- 
oriented document storage and retrieval system, 
and SURF, a personal file retrieval system, were 
further refined and received initial usage. A 
new method of automatic document classification, 
called ALCAPP, broke through previously restric- 
tive barriers of cost and storage space. A paper 
on ALCAPP, as well as one on an SDC study to 
evaluate document representations, were two of the 
three prize-winners at the annual meeting of the 
American Documentation Institute. 



The education and training area completed several 
studies in 1966, including the development of a 
computer-based simulation of an innovative school; 
the development of 28 criterion tests that indicate 
absolute levels of mastery in foreign- language 
comprehension, speaking, reading, and writing; and 
a comparison of linear vs. branching strategies 
in presenting programmed material. Two major new 
studies were begun during the year; the develop- 
ment of a computer-based educational system for 
the Southwest Regional Laboratory for Educational 
Research and Development e and a study to adapt the 
SDC -designed "empirical trial-and- revision" 
process to the development of instructional 
materials and procedures for classrooms serving 
predominantly Spanish-American students. Consid- 
erable progress was made in ongoing investigations 
into the use of time-sharing in education: in one 

case to improve the counseling function; in another 
to improve the teaching of statistical inference 
at the college level. A very promising outcome 
of the last-named study is an on-line lesson design 
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and teaching program called PLANT!', which is 



receiving wide interest and initial usage. 



The mathematics and operations research staff 
continued to solve a number of challenging prob- 
lems. With partial funding from the Bureau of 
Public Roads, SDC's Vehicular Traffic Study 
completed the coding, debugging, and exercising 
of an initial version of a computer simulation 
model of a freeway diamond interchange. A new 
project was begun on ways to compress the enor- 
mous amounts of data transmitted from spaceborne 
hardware. In celestial mechanics, new procedures 
for existence proofs were derived and applied, 
and a very efficient numerical technique was 
devised for solving certain differential equations. 
Additional work yielded new results in such areas 
as optimal strategies for item presentation in 
education, stochastic duels, life-testing, 
validation of simulation models, factor analysis, 
and mathematical programming. The algorithmic 
languages project continued to contribute 
important insights into formal languages through 
the noteworthy publication of 12 papers in 
major journals during 1966. 



In the area of man-machine interaction, work 
continued along a broad front. A robot-like 
system, capable of following simple commands, 
was programmed and several films were made of 
its operation on a display scope. The augmenta- 
tion of human intellect by machine was furthered 
by the Initiation of an ’’augmented statistician" 
project, the development of an interactive prob- 
lem-solving task called Shlmoku, the completion 
of a set of experiments to test the effectiveness 
of various display aids to human problem solving, 
and the development of various on-line data 
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manipulation systems. The continuing research 
on bargaining and negotiation behavior completed 
several major experiments during the year, 
Including a transnational study which culminated 
in a conference of participating researchers 
from various nations in Santa Monica in November. 
The Leviathan system, a computer-based model of 
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large social organisations, was used experimentally 
as a training tool in a management workshop con- 
ducted at the University of Southern California 
for Air Force officers. 



As mentioned before, the Computer Center 
Department completed design and programming for 
a number of operating systems on a series of 
IBM 360s: a batch-processing Model 50, which was 

operational from October 1965 through July 1966; 
an interim batch-processing system on the Model 65, 
operational since July; and a time-sharing system 
on the Model 65, for which an initial version was 
released in November. In addition, the Department 
has been developing other supporting aids, includ- 
ing a 360 JOVIAL compiler, on-line and off-line 
debugging tools, program and text editing capabil- 
ities, file maintenance programs, and assembly 
language processors. The laboratory and planning 
staffs have been active in the selection and 
acquisition of a complex of displays and other 
ancillary equipment to be used in connection with 
the 360. 



Organizational Changes and Appointments 

In recognition of his technical and administra- 
tive accomplishments as assistant to the R&TD 
Manager, Bill Barancik was appointed Assistant 
Division Manager in January 1967. 

During 1966, several organizational changes took 
place in the management and composition of the 
various areas of the Research and Technology 
Directorates. In March, Gerald Shure was named 
head of the Decision Processes Research staff. 

In August, the Research Directorate underwent a 
restructuring: the concept of "staff" areas was 

changed to one of "program" areas; Decision 
Processes Research was replaced by Behavioral 
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Gaming & Simulation (with Shure as head); and a 
new program, Augmentation of Man's Intellect, was 
formed with Research Director Kenneth Yarnold as 
acting head. In December, a logical bifurcation 
of the Language Processing and Retrieval staff 
resulted it. a research-oriented Language Processing 
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Research Program, under Robert Simmons, in the 
Research Directorate, and an applies* iona-or iented 
Information Systems Technology staff, under 
Carlos Cuadra, in the Technology Directorate. 

In the Computer Center Department, A! Irvine, 
formerly head of the UCLA Computer Network pro- 
ject's programming staff, joined SDC to head the 
CCD's Programming Branch, and Jerry Hanna was 
appointed Assistant to the CCD Manager, respon- 
sible for the nonprogramming aspects of the 
Department's activities. 

Other major appointments during the year 
included: Andy Gafarian (Mathematics & Operations 

Research Program) and Gerald Shura to Senior 
Scientists; and Robert Bleier and Sally Bowman 
(both of the Data Base Systems staff) to Research 
Leaders. 

Interaction with Other Divisions 



in September 1966. The committees, chaired by 
senior corporate managers and composed of high- 
level technical representatives from the various 
SDC organisations, are intended to provide a 
mechanism for the communication of technical 
information among SDC 's divisions and to help 
solve common problems in the areas of data base 
systems, education and training technology, 
executive systems, natural language processing, 
and programming languages. In addition to active 
participation on the Committees, R&TD has supported 
them through initial conception and planning, and 
provision of recording personnel. During the last 
four months of 1966 the five Committees met 
a total of Jl times, and can point to a number of 
accomplishments, such as the undertaking of new 
interdivisionai projects, improved utilization of 
common resources, and measurably enhanced communi- 
cations. 











During 1966, the Research & Technology 
Division increased its efforts to communicate 
with and support i ;r corporate organizations. 

A survey undertaken in June of 1966 indicated 
that, during that month alone, approximately 60 
R&TD personnel, or about 30 percent of the profes- 
sional staff, were actively engaged in 23 clearly 
identifiable, special-purpose, ad hoc activities 
in support of other SDC organizations. These 
activities ranged from briefings on R&TD projects 
to long-term consultation on projects of other 
divisions. It should be stressed that this survey 
did not cover the normally ongoing corporate-wide 



Another innovation in 1966 was the R&TD internship 
program. Under this program, individuals selected 
by the other divisions join R&TD for periods of up 
to six months, to participate actively in research 
and technology projects. The objective is to pro- 
vide the participants with a working knowledge 
of R&TD ' s technology, which they will ultimately 
carry back to their own organizations. Eight 
interns joined R&TD in the last half of 1966. 

Judging from initial indications, the program is 
accomplishing its aims and will be continued in 
1967. 






responsibilities of R&TD, which include the 
development of the Computer Center, maintenance 
of information centers on information processing 
and information science, furnishing of statistical 
services, and, more generally, the overall 
development of the products of research and 
technology, which are intended for the use of 
the other SDC organizations and their customers. 



' ’ * <~uay 

sessions of special briefings and demonstrations 
expressly for personnel in the other divisions. 
Approximately 150 middle management and senior 
technical personnel from throughout SDC received 
detailed descriptions of the ten most frequently 
demonstrated R&TD products. 



An important advance in corporate communication 
was the formation of five Interdivisionai Tech- 
nical Steering Committees by SDC President Melahn 



Communications 

As in the past, R&TD continued an active program 
of communication, both internally and externally. 
Division personnel were coordinators and hosts for 
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a number of local, national and international 
meetings. Also, 20 research colloquia, featuring 
both SDC and external speakers, were held during 
the year; these are open to all SDC personnel 
and 1 invited outsiders. 

0ne of the most effective ways in which we 
have communicated to the outside world is through 
live demonstrations of the programs developed 
in R&TD. During 1966, several thousand visitors 
to SDC received first-hand exposure to man-machine 
interaction under time-sharing, as the computer 
displayed to them, on scopes or teletypes, the 
results of the actions taken by the demonstrators 
or, in many cases, by the visitors themselves. 
Additionally we gave or supported a large number 
of remote demonstrations, linked via teletype to 
the Q-32 computer in Santa Monica, at symposia 
throughout the country and abroad. During the 
year considerable effort was devoted to Improving 
our demonstrations, from the standpoints of both 
presentation and technical support. A number of 
the more frequently demonstrated programs are 
briefly described in the Appendix. 

During 1966, approximately 40 consultants and 
selected graduate students participated in the 
Division's program, over periods ranging from 
several days to several months. Apart from 
the technical contributions made by these people, 
the exchange of ideas gained by these close 
working relationships has been of great value to 
SDC and to the visiting personnel. 

Another effective medium of communication is 
the "lend-lease" program, instituted in 1966. 
Similar to the internship program described above, 
the lend-lease program enables technical special- 
ists from outside agencies to work on SDC projects s 
providing a valuable exchange of ideas between 
these people and R&TD researchers. During 1966, 
personnel from Shell Oil, Atlantic-Richf ield, 

IBM, and Bolt, Beranek and Newman joined R&TD to 
participate in the development of projects in 
data management and programming languages. 



As is traditional in science and technology, 

SDC makes a great effort* to communicate its 
findings to the external community. During 1966, 
Division personnel published 60 articles in the 
external literature; this was augmented by over 
300 SDC documents, most of which are available 
through the Defense Documentation Center. 

Finally, R&TD personnel gave a total of 175 oral 
presentations for professional meetings, university 
colloquia, and invited lectures. In addition, 
many members of the Division are officers in their 
professional societies and editors for journals 
in their field. 

Detailed information on all these activities 
can be found at the back of this report. 

Postdoctoral Fellowship Program 

In the fall of 1966, R&TD instituted a Post- 
doctoral Resident Research Fellowship Program. 
Fellows selected under this program will receive 
a $9,000 stipend while conducting research of 
their own choosing in the Research & Technology 
Division in Santa Monica. Major resources available 
to Fellows are the knowledge and experience of a 
multidisciplinary staff of senior investigators, 
and the facilities of a computer-based man-machine 
laboratory. 

A representative, but by no means exhaustive, 
list of areas that may be proposed for research 
includes man-machine Interaction, operations 
research, mathematical modeling, digital simula- 
tion, education and training, experimental gaming, 
decision making, computational linguistics, 
information management, computer graphics, 
automata theory, formal and programming languages, 
programming systems, and the application of informa- 
tion processing to law, medicine, economics, and 
other fields. 

The fellowship program was developed partly as 
a result of the successful experiences of NSF and 
NIH fellows who have spent their research periods 
at SDC. 
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Research Advisory Committee 
The Research Advisory Committee continued to 
play an important role in the Corporation. 

During 1966, the Committee met for four 2-day 
sessions at Santa Monica (including a joint 
meeting with the Board of Trustees) to consider 
the work of R&ID and co offer broad guidance on 
the overall program. During the year the RAC 
focused on the following areas: R&TD's operating 

plan,, the "augmentation of man's intellect" pro- 
gram, education and training, and executive 

systems. 

* 

In July, President Melahn announced the 
rotation of the chairmanship of the RAC to 
Dr. Merrill Flood, succeeding Dr. C. West 
Churchman who had chaired the RAC since its 
inception in 1962. Dr. Churchman continues to 
serve as a valued member of the Committee. The 
complete list of RAC members is as follows: 

Dr. Merrill M. Flood (Chairman) 

Professor and Senior Research Mathematician 
University of Michigan 

Dr. William C. Biel (Secretary) 

Associate Dean of the Graduate School 
University of Southern California 

Dr. C. West Churchman 

Professor of Business Administration 

University of California, Berkeley 

Dr. Harry D. Huskey 

Professor of Mathematics and Electrical 
Engineering 

University of California, Berkeley 

Dr. John L. Kennedy 
Department of Psychology 
Princeton University 



Dr. Anthony G. Oettinger 

Professor of Mathematics and Linguistics 

Harvard University 

General Earle E. Partridge 
USAF, Retired 

Organization of Report 

In the main, the report that follows has been 
organized to reflect the several threads of attack 
indicated in the technical overview. In cases 
where a project belongs administratively under 
one area, but fits functionally more clearly in 
another, the functional relationships have 
governed. Thus, the administrative structure of 
R&1D, and in particular the organizational changes 
that occurred late in the year, are not necessarily 
reflected in the project descriptions that follow. 

If it becomes increasingly difficult for us 
to pigeonhole a given project— for example, to 
judge whether the PLANIT language for on-line 
lesson design belongs under education or man- 
machine interaction or programming languages or 
some other area— we take these multiple attach- 
ments to be a healthy sign. Our aim is to 
continue to break down the traditional barriers 
imposed by different disciplines and skill fields, 
with the goal of producing tools that draw upon 
many areas of knowledge to fulfill a wide range 
of uses. 



Donald L. Drukey 

Vice President and Manager 

Research & Technology Division 
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ADVANCED PROGRAMMING 



E. H. Jacobs, Head 



The Research & Technology Division's extensive 
work in progranmii ,g technology is being conducted 
by several staffs. For purposes of this report, 
these efforts have been integrated in a single 
section titled "Advanced Programming." Con- 
stituting the major part of this section is the 
work of the Programming Technology staff, headed 
by E. H. Jacobs. Also included are appropriate 
parts of the work of the Programming Systems 
staff (see also page 3-1), the Information 
Processing Research staff (see also page 2-1), 
and the LISP development activity. 

The area oi Advanced Programming embraces study 
and development of tools and techniques for the 
computer programmer, the nonprogranming user of 
computers, and the manager of computing installa- 
tions. The major areas being explored are 
compilers and programming languages, aids to 
programmers, and studies of the programming 
process. 



The first of these, compilers and programming 
languages, has been a major activity at SDC for 
some years and the JOVIAL language end compilers 
have emerged as usable tools. Continuing research 
aims at finding new techniques of compiler pro- 
duction and at increasing the language capability 
available to programmers. 



In the realm of improved compilers, the 
techniques of metacompilation are being studied, 
as a possible means of reducing the time and cost 
required to produce compilers. These techniques 
have shown the capability to produce certain parts 
of compilers,, notably the so-called "front-end" 



which translates from source language to an inter- 
mediate language and, in some cases, to machine 
language. Both a metacompiler and a metalanguage 
interpreter have been constructed and are being 
vised to explore the problems of "describing" a 
compiler. The metalanguage interpreter has also 
been used for other applications, including the 
translation of one POL into another. 



In programming languages, one current project 
is aimed at extending LISP, an advanced list 
processing language. List processing languages 
have been found useful in work involving extensive 
manipulation of symbols (as opposed to arithmetic 
computation), but their utility has been blunted 
by the alow speed of their processors and by tfteir 
limited capability t:o handle problems involving 
both symbolic manipulation and arithmetic computa- 
tion. The LISP project is producing an advanced 
processor, with a built-in computational facility. 
One version of such a processor, called LISP 1.5, 
was built during the year and used to test ideas. 
Another version, LISP 2, is being built to imple- 
ment these ideas in a complete system. 



Exploration in another dimension, furnishing 
new aids to programming, is being vigorously 
pursued. The work includes finding aids that 
help the programmer to make better use of the 
higher level languages, making it easier to get 
programs written and assisting in code checking. 

A major project in this area is the design and 
development of an interactive programming support 
system. Such a system will he designed to give 
the programmer the maximum benefit of the on-line 
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capability available in a time- sharing mode of 
operation. Compilers and checkout tools specifi- 
cally designed for a time-sharing environment 
will have many more points of programmer inter- 
action than are normally found. Further, the 
language or control structure will be shaped to 
allow the programmer to switch from function to 
function (for examp^ , from a compile mode to a 
debug mode) without having to make a corresponding 
switch in language. 

Several other projects are expected to contri- 
bute to the interactive system. The work on 
compiler construction described earlier will 
provide the basis for the compiler used in the 
system. Other contributing projects are Automated 
Flow Analysis and Graphic Input/Output. The first 
of these is developing programs to analyze other 
programs in order to get a machine-produced flow 
chart. During the year, a program was written 
that produces a series of flow charts from a 
program written in JOVIAL. The first chart is 
very detailed and successive ones present less 
detailed information, giving a better overall 
picture. A byproduct of this work was the 
development of a program to automatically improve 
code written in JOVIAL. 

The Graphic Input/Output project is developing 
techniques by which a programmer may write his 
code directly Into the computer (as opposed to 



use of a teletypewriter or keypunch). Initial 
work is on character recognition routines that 
can operate rapidly enough to be useful in the 
on-line environment. At the same time, routines 
that display lines of code and make deletions and 
insertions are being developed. Several experi- 
mental recognition routines as well as parts of 
the needed control programs have been written and 
checked out on the Phileo 2000. 

Study of the program development process has 
also been a significant activity. This research 
is aimed at systematizing and improving control 
and planning techniques for use by managers of 
computer program development. The work includes 
analysis of the process of program development to 
identify relationships among programming products, 
resources, and environment. The goals are to 
identify and develop economical and efficient 
management methods for realizing programming 
products and to establish criteria for measuring 
the quality of these. 

The central effort in this area has been the 
statistical analysis of numerical data, character- 
izing completed computer programs, to derive 
improved methods for estimating costs. The work 
has resulted in publication of a handbook for use 
by managers in preparing estimates of a computer 
program development project. 
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COMPILER CONSTRUCTION TECHNIQUES 

Syntax-Directed Compilers and Translators* 

E. Book 

J. Igawa 

M. Schaefer 

D. V. Schorre 

Description 

The objective of this study is to develop 
techniques that will simplify the task of pro- 
ducing a compiler f r a new computer. SDC's 
experience has shown -hat the 9 to 12 months 
currently required to produce a compiler for 
command- control systems is a serious delay in the 
production of operational programs. The use of 
metacompilers has received much study by the 
computer community and is the focus of this 
project. This technique offers not only a pos- 
sibility for constructing compilers for new 
machines, but also a way to build compilers for 
new languages. 

The metacompiler makes use of a rigorous formal 
description of the language for which a compiler 
is to be produced. In addition, it requires a 
description of the translation (or compilation) 
process. (This description includes, either 
explicitly or implicitly, a description of the 
target machine language.) 

The work has taken two major forms, called META 
and META5. 

META is a compiler system which accepts as input 
a description of a desired compiler, in a special- 



in data manipulation, e.g. , relational and 
arithmetic operators; an assignment operator; 
search, concatenate, enter operators; etc. 

The META language consists of two sublanguages. 
The first of these, called SYNTAX, is used 1 to 
describe the syntax of a desired source language, 
specifying a mapping of a program in the source 
language into a tree- structure representation. 

The second sublanguage, called GENERATORS, spec- 
ifies the correspondence of the tree etruotcre 
to a desired target language. This is the 
semantics of the source language. It is the 
link between the string of marks input to the 
computer (the source language program) and the 
actions to be performed by the computer to 
achieve th*. desired result. This language is a 
cross between a pattern matching notation and a 
macro notation. It is a new language and will 
be developed further. The two sublanguages are 
nonprocedural or descriptive languages which are 
specially designed, for the specification of 
compilers and/or interpreters. 

The subroutines that do the work specified by 
the SYNTAX and GENERATOR languages are written 
in a language called MOL (Machine Oriented 
Language). These routines could have been 
written in any procedural programming language 
such a 8 JOVIAL, ALGOL, MAD, etc., or even in 
assembly language. However, the MOL language 
was designed and a compiler for it implemented 
for the reasons specified below. 




ired form, and which outputs the desired compiler 
in an executable form.. 

META5 is an interpretive system which consists 
of the META5 language, a META5 compiler, and a 
pseudomachine which is implemented on the Q-32 
computer. The MBTA5 language allows a variety of. 
data structures to be declared and used in the 
language. It also contains some operations not 
yet available in a system like META, but useful 



First, MOL is based on a compromise. On the 
one hand, it is desirable to take advantage of 
the computer hardware in producing machine code 
for various features of the language. On the 
other hand, it is undesirable to descend to the 
level of actual machine operations. MOL has an 
ALGOL-like *lavC>, but the operands and some 
operators are concerned with machine registers. 
Such things as indirect addressing, partial word 
fields, user control of index registers, and 
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similar considerations are specifiable in this 
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language. MOL combines an assembler's vocabulary 
with a compiler's grammar. 



Second, it was felt that more control could be 
exercised in adding features that would mesh this 
language into the META system if it were specifi- 
cally designed and implemented for that purpose. 

Third, since MOL is a procedural language, it 
provides an early test of the ability of the 
system to describe a conventional compiler. 



Progress 

The SYNTAX and GENERATOR languages were designed 
and a description of the syntax of both languages 
was written in SYNTAX. This also showed the 
mapping into the internal model (tree structure). 
The mapping of the internal model to Q-32 code 
was described for both SYNTAX and GENERATOR in 
the GENERATOR language. 



The procedural language M0L-Q32 was implemented 
in an earlier version of META which did not have 
the generator language. The subroutines required 
to implement SYNTAX and GENERATOR were written 
in MOL-Q32. This resulted in the entire META 
system being described in its own language and 
able to compile itself. This process was actually 
performed by bootstrapping from a more primitive 
version of the system. 

A MOL for SDC's S/360 System was designed and 
a compiler for it was described in META. The 
GENERATOR portion of META was changed to produce 
IBM 360 code, instead of Q-32 code, from the 
internal model. The resulting version of the 
META system for the 360 is currently being checked 
out. The bootstrap process to move from one 
computer to another is very well defined using 
this method. 



The META5 system has been useful in describing 
POL- to- POL translations (p. 1-5), data base con- 
versions, some analysis off a subset of English 
as used in questions, etc. Specifically, JTS-to- 
JS, TINT-to-JTS, J3- to- JS , FORTRAN- to- JTS, and 
JS- to- PL/I conversions have been accomplished. 



Various data bases have been converted for the 
LUCID system (p. 4-7). The META5 system has also 
been used to reformat KETA5 programs and to write 
the META5 compiler. A calculation program and an 
input processor for JOVIAL constants have been 
written in META5. The last two uses were of 
particular interest since they were written for 
on-line interaction between the user and the 
computer. 

As the META5 system became a working tool for 
POL- to- POL translation, the META5 language was 
expanded to facilitate string and character 
manipulation. The META5 system is currently 
being moved to the S/360 computer and, since the 
META5 language is totally machine independent, 
programs written for Q-32 META5 will run with no 
modification on the. 360. The J3-to-JS translator 
was used to bootstrap the META5 system over to 
the JS/ 360 dialect off JOVIAL. 



Plans 

While the META5 system is being moved to the 
S/360, improvements will continue to the language 
and system as needed. Tree-building and manipu- 
lation capabilities may be added along with 
multiple input/output capabilities, to permit 
the production of compilers. Debugging capabili- 
ties will be made available. Study will be made 
of the feasibility off partial recompilation of 
META5 programs. More POL- to- POL translators will 
be written including a PL/I-to-JS translator. 

The META system is also being moved to the 
S/360. When this is completed, improvements will 
be made. One planned improvement is the addition 
of a feature to describe and handle a dictionary 
containing information about data types. The 
GENERATOR language will continue to be developed. 
After this, work will split into two directions: 
(1) further investigation of metalanguages and 
processors, and (2) the use of META to produce 
compilers and interpreters for various non- 
procedural languages for experimentation. 
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Project Documentation 

1. Oppenheim, D. K. The META5 language r.nd 
system. SDC document TM-2396'. July 21, 1965. 
49 pp. 

2. Book* E. and Schorre , D. V. A higher- level 

machine-oriented language as an alternative 
to assembly language. SDC document TM-3086/ 
001/00. August 12, 1966. 29 pp. 

3. Oppenheim, D. K. and Haggerty, D. P. META5: 

A tool to manipulate strings of data. 
Proceedings of the 21st National ACM 
Conference . August 1966, 465-468. (Available 
as SDC document SP-2243/000/01. ) 

Trans lat ion Be tween Proc edure - 
Oriented Languages 

D. P. flags*' rty 



Description 

A major difficulty hindering the introduction 
of improved procedure-oriented languages (POLs) 
into an established computing facility is the 
incompatibility between the new or improved 
languages and older languages long in use. It is 
often desirable to modify old programs by using 
the new language or to combine routines written 
in the old language with routines written in the 
new one. One method of minimizing this difficulty 
is the design of translation programs between old 
and new POLs. 

Tills project is exploring the possibilities and 
limitations of automatic translation of one POL 
into another. 

Several programs designed to accomplish a 
translation from one POL to another existed at 



The languages selected for translation in this [ 

particular study were JOVIAL (JS version) and ^ 

PL/I (i.e. , a JS-to-PL/i translator) and the META5 
system was selected to realize the translator. 

The language pair was chosen because both would l 

r 

be available on IBM System/360 and a POL- to- POL | 

translator would have practical value The META5 
technique was selected because a TINT-to-JTS | 

translation was partially implemented at the time 
and seemed to indicate that the method could be 
applied to the more complex JS-to-PL/I translation. 

Progress | 

i ! 

At present, the translator is operating on the h 

0-32 under the Time- Sharing System. 

The translator was written in the META5 language 
and debugged on-line under the Time-Sharing System. 

The use of the META5 language facilitated the | 

writing of the translator in several ways; the f. 

specification of the syntactic recognition process if 

for the various JS forms is not only precise and j$ 

transparently explicit but also succinct. The t 

\, 

transformational processes necessary to produce 

the PL/ I equivalents are also expressed quite 

clearly and briefly. < 

The language specifications as a basis for 1 

writing the translator were, for JS: SDC TM-1682/ jP 

003/00, JOVIAL (J-6) Grammar and Lexicon; and for 
PL/I: IBM, File No. S360-29, FORM C28-6571-3, | 

IBM System/360 Operating System PL/I Language 
Specifications. jp 
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the inception of this project — an SDC program to 
translate FORTRAN II to JOVIAL (J2) and an IBM 
FORTRAN II- to- FORTRAN IV translator. Two others 
were being programmed- -a TINT-to-JTS translator 
and a JTS-to-JS translator. 

The first two translators were produced using 
"traditional" methods, i.e. , they were written 
in the conventional programming languages J2 and 
FORTRAN, respectively; in contrast, the latter 



Although it is assumed that the JS input text 
has been found syntactically correct by the 
‘generator* phase of a JS compiler, the philosophy 
adopted with respect to the translator has been 
to produce as complete a translation as possible 
no matter how incorrect or garbled the input text 
may be; hence non-JS program segments will produce 
output, e.g. , J3 programs will translate although 
usually incompletely. 



I 

I 



two translators were written in META5, a syntax- 
directed compiler-writing system (see page 1-3). 



The translator has converted a JS program that 
was designed to test the ability of a JOVIAL 
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compiler co compile a JS program. The PL/ I 
equivalent* of this program has been submitted to 
IBM for syntax-checking by that part of the PL/ I 
compiler; the program consisted of about 315 
statements and 120 data declarations. A much 
larger J3 program (about 1800 statements and 250 
data declarations) has also been translated; this 
translation should give an indication of how 
useful the translator is for translating J3 
programs (for which it is not designed), 
i.e. , how difficult the postediting job is. 



Compiler Techniques for Paging * 

R. J. Dinsmore 

Description 

The goal of this project is to establish the 
compiler techniques for producing computer code 
that takes advantage of "page" and "segmentation" 
features of new computers such as the IBM 360/67 
and the GE 645. 

When the current generation of compilers was 
designed, computer memories generally consisted 



► 



> 



Several PL/ I translations have been compiled 
by the F- level compiler and have revealed errors 
in the translator. 

The project has noted several aspects of pro- 
gramming languages that add to the difficulty of 
translation (these remarks also apply to the 
compilation process): (I) any ad hoc devices 

that a particular compiler uses to "fit" a 
language to a machine constrict the translation 
process; (2) the existence of structural con- 
nectivity (see E. T. Irons , "Structural 
Connections in Formal Languages," Communications 
of the ACM . Vol. 7, No. 2, pp. 67-72, for an 
exposition of this concept) in a language adds 
a considerable burden to a translator, and hence 
should not be introduced unnecessarily. 

Plans 



of a single block of locations whose addresses 
were simply a continuous sequence from zero to 
the highest available. The new computers, however, 
subdivide memory into sma? ’• r blocks called 
"pages" and "segments," and it appears that 
special compiling techniques are needed to properly 
utilize these features. That is, compilers must 
produce computer programs that will take advantage 
of the small blocks of memory and minimize the 
number of times a process is interrupted for the 
loading of new pages or segments. 

Other SDC projects (Automatic Code Improvement 
and Automatic Flow Charting) have shown that the 
generator phase of a compiler can obtain a great 
deal of information about the structure of a 
program, and it is anticipated that this is the 
kind of information useful in the paging problem. 



' 1 



'H ; 



The META5 system is being converted to the IBM 
System 360. When this Is completed, study Of 
translation techniques will continue through the 
application of META5 to other language pairs. 

In particular, a PL/I- to- JOVIAL translator offers 
a promising research avenue. 

Project Documentation 

1. Haggerty, D. P. Use of the JS to PL/I 

translator. SDC document TM-2823. January 19, 
1966. 4 pp. 

2. Haggerty, D. P. JSPL: An application of the 

META5 system. SDC document TM-3003. June 14, 
1966 . 18 pp. 



Progress 

The work plan on this project consists of a 
study of an existing compiler, the construction 
of a compiler that includes the new techniques, 
and a study of gains realized. 

This project was instituted * ite in the year 
with a study of the JOVIAL compiler for the 
IBM 360 and some of its output code. A number 
of possibilities for structuring a program have 
been hypothesized and are being considered 
for implementation in a compiler. Also the 



Supported by the Advanced Research Projects 
Agency. 
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differences in design between existing compilers 
and a paging compiler are being investigated. 

In addition to the program structure itself, 
two additional techniques are being considered. 
One of these is to have the program utilize 
statistics on its own past behavior in loops and 
at branch points. The other technique is to have 
the program furnish notice of its future page 
needs to the executive,, 

Plans 

Work will continue to find more potentially 
profitable ways in which programs can be 
structured. An experimental compiler will be 
built to test each of these techniques. 

PL/I for SDC 360 Time-Sharing System 
W. E. Meyer 

Description 

The purpose* of this project is to investigate 
the difficulties involved in moving a compiler 
from one third- generation operating system to 
another. Specifically, the project is investi- 
gating the transfer of the PL/I F- level compiler 
from the IBM OS/ 360, a system designed for 
multiprogramming, to the SDC 360 Time-Sharing 
System. 

The dependence of compilers and assemblers on 
operating systems has been increasing over the 
last ten years. This increasing dependence 
implies greater difficulty in transferring 
compilers from one system to another. With the 
advent of multiprogramming systems, this depen- 
dency is increased substantially because of the 
desirability of dynamic relocation, standard data 
structures, and flexible and dynamic linking of 
program segments. 

Because of the widespread interest in PL/ I and 
because of its potential usefulness in the SDC 
Time-Sharing System, PL/I was chosen as the 
object of this study. It was written to operate 
with the IBM OS/ 360; the SDC 360 System is 



sufficiently different that the transfer require- 
ments are significant. 

Progress 

A list of probable problem areas was developed. 
They are: 

1. The compiler interface with the operating 
system. 

2. The compiled program's interface with the 
operating system. 

3. The form of the compiled program as output 
by the compiler. 

4. The form of data aggregates (e.g., files, 
records, data sets, control blocks, etc.) recog- 
nized by the operating system and the compiler. 

5. The differences in input/output procedures; 
this is related to 4, above. 

6. Dynamic (execution time) calls on the 
library. (This may be unique with OS/360.) 

7. Possible side effects introduced in moving 
support programs from one operating system to 
another. 

In the latter part of the year, Program Logic 
Manuals for the PL/I compiler and the FL/I library 
were received from IBM, and a study of the compiler 
and the library was begun. All compiler inker- 
facing with OS/ 360 is contained 1 in six control 
programs. All references to OS/360 in these 
routines have been identified. There are 62 
calls, generated by 19 different system macros. 

This list of references will be used as a basis 
for the evaluation of the problem area 1, above. 

With respect to the object program's interface, 
the PL/I compiler does not generate code that is 
operating-system dependent; all operating system 
interfacing is isolated in the PL/I library. 

There are 17 library routines using 20 system 
macros that supply the object code interface 
with the operating system. These macros are, 
for the most part, the same as those used in the 
compiler-0S/360 interface. 

The form of the output program is not a problem 
in the case of PL/ I since the "object module" that 
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is output must be processed by the linkage editor, 
which outputs a program in a form readable by the 
loader. The need for link-editing arises because 
all memory allocation, data management, data 
conversion, and input/output processes have bean 
placed in the PL/I library and the library 
routines must be linked to the object program 
by the linkage editor. 

Plans 

The study of the problem areas will be con- 
tinued. The variances between the OS/ 360 and 
the SDC 360 System data aggregates will be 
tabulated and the differences between the two 
system interfaces will be evaluated. 



PROGRAMMING LANGUAGES 



LISP 2* 



SDC; J. A. Barnett 

Project Leader 

D. C. Firth 
R. E. Long 

E . Book 

R. E. Martin 
C. Weissman 

S. L. Kameny 

Description 



III: L. Hawkinson 

M. I. Levin 
P. W. Abrahams 

D. Crandal 
R. A. Saunders 

E. C. Berkeley 



The LISP 2 Project is a joint development of 
SDC and Information International, Inc. 



LISP 2, which is based on LISP 1.3, is a new 
programming language for manipulating complex 
data structures and performing lengthy arithmetic 
calculations. As in LISP 1.5, programs can be 
treated as data, and storage can be regained 
through a compacting technique known as "garbage 
collection." The LISP 2 Source Language (SL), 
which resembles ALGOL, is the standard input; 
the LISP 2 Intermediate Language (IL), which 
resembles LISP 1.5, is used for programs that are 
to be treated as data. Type declarations are 
available for efficient compilation of arithmetic 
operations. LISP 2 includes bit operators and an 
open subroutine capability. The most general form 



♦Supported by the Advanced Research Projects 
Agency. 



of a datum is a symbolic expression; other forms 



include numbers, functions, strings, and integer- 
indexed arrays. All of the system programs are 



themselves written in LISP 2. The 1/0 package 
transforms input into a stream of characters which 
are converted into tokens by the finite-state 



machine. The supervisor controls the various 



LISP 2 operations. SL is translated into IL by 
the syntax translator; IL is translated into 



assembly language by the compiler; and assembly 
language is translated into machine language by 

the LISP 2 assembler, LAP. Machine mobility is 
achieved through core image generation. (See 
Figure 1-1.) 



Presently implemented on the Q-32 computer, 

LISP 2 has two components: the language itself, 

and the programming system in which it is embedded. 
The system programs that define the language are 
accessible to and modifiable by the user; thus 
the user has an unparalleled ability to shape the 
language to suit his own needs and to utilize 
parts of the system as building blocks in con- 
structing his own programs. 

While it provides these capabilities to the do- 
it-yourself programmer, LISP 2 also provides the 
complete and convenient programming facilities 
of a ready-made system. Typical application areas 
for LISP 2 include heuristic programming, algebraic 
manipulation, linguistic analysis and machine 
translation of natural and artificial languages, 
analysis of particle reactions in high-energy 
physics, artificial! intelligence, pattern recog- 
nition, mathematical logic and automata theory, 
automatic theorem proving, game playing, infor- 
mation retrieval, numerical computation, and 
exploration of new programming technology. 

The LISP 2 programming system provides not only 
a compiler, but also a large collection of run- 
time facilities. These facilities include the 
library functions, a monitor for control and on- 
line interaction, automatic storage management, 
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FIGURE 1-1. LISP 2 SYSTEM COMPONENTS AND INFOBMATION FLOW PATHS 
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and communication with the monitor system of the 
machine on which the system is operating. 

A particularly important part of the program 
library is a group of programs for bootstrapping 
LISP 2 onto a new machine. (Bootstrapping is 
the standard method for creating a LISP 2 system 
on a new machine.) The bootstrapping process, 
achieved by the technique of generating a complete 
LISP 2 core image on the target machine (core 
image generation), represents a major milestone 
In the implementation of large systems. The LAP 
assembler, a part of the core image generator, 
produces relocatable binary code. During the 
bootstrap process, this code is assigned an 
absolute core address and relocated as necessary 
as if it were being loaded. Associated data 
structures are assigned locations and their binary 
core images generated. The code, data structures, 
and binary images are then placed on an external 
output device such as magnetic tape. The boot- 
* strapping capability is sufficiently powerful so 
that the new machine requires no resident programs 
other than the standard monitor and a non - 
reloca table binary loader to read the tape 
produced. 

LISP 2 includes and extends the capabilities of 
its ancestor, LISP 1.5, which is notable for its 
mathematical elegance and symbol-manipulating 
capabilities. However, LISP 1.5 lacks a con- 
venient input language and efficiency in the 
treatment of purely arithmetic operations. 

LISP 2 was designed to maintain the advantages 
of LISP 1.5 while remedying its deficiencies. 

The first major change has been the introduction 
of two distinct language levels: Source Language 

(SL) and Intermediate Language (IL). The two 
languages have different syntaxes but the 
same semantics (in the sense that for every SL 
program there is a computationally equivalent IL 
program). The syntax of SL resembles that of 
ALGOL 60 while the syntax of IL resembles that of 
LISP 1.5. IL is designed to have the same 



structure as data, and thus to be capable of being 
manipulated easily by user (and system) programs. 

An advantage of the ALGOL-like source language is 
that the ALGOL algorithms can be utilized with 
little change. 

The second major change has been the introduction 
of type declarations and new data types, including 
integer- indexed arrays and character strings. At 
a future time, packed data tables, which can 
presently be simulated through programming 
techniques, will be added. Type declarations are 
necessary to obtain efficient compiled code, 
particularly for arithmetic operations, but by 
using the default mechanisms (whereby the system 
automatically makes type declarations), a pro- 
grammer may omit type declarations entirely 
(albeit at the cost of efficiency). 

Figure 1-2 shows an example program, called 
RANDOM, in its Source Language format , in the 
equivalent Intermediate Language program as 
produced by the syntax translator, and in the LAP 
output from the compiler. 

Progress 

A first LISP 2 system was implemented on the 
Q-32 and demonstrated in May of 1966. The 
LISP 2 Intermediate Language (IL) was used for 
all programming of LISP 2 including the system 
primitives; IL was found to be powerful enough 
so that little or no machine language or assembly 
language code was required. A few minor changes 
were made in IL on the basis of experience 
obtained by project personnel in using IL. The 
system was produced through the core image gen- 
eration process using LISP 1.5 on the Q-32 as the 
bootstrapping vehicle. A syntax translator for 
translating from source language to IL was written 
using metacompiling techniques (see page 1-3). 

A simple pattern matching routine and a LISP 
"pretty print" have been programmed in LISP 2 
to aid in system checkout. Also, several 
extensions to LISP 1.5, including a context 
editor, were required. 
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<S0URCE LANGUAGE* 



REAL SECTION TEST: 

7.R RANDOM COMPUTES A RANDOM NUMBER IN 
%R THE INTERVAL (A,B) 

DECLARE (Y-l) INTEGER OWN Y: 

REAL FUNCTION RANDOM (A, B): 

DO Y~3125*Y; 

Y~Y\67 108864; %R \ DENOTES REMAINDER 
RETURN A+Y*(B-A) /67 108864.0; 



E14D; 



^INTERMEDIATE LANGUAGE* 



(SECTION TEST REAL) 

(DECLARE (Y INTEGER OWN 1)) 

(FUNCTION (RANDOM REAL) ((A REAL) (B REAL)) 

(BLOCK () 

(SET Y (TIMES 3125 Y» 

(SET Y (REMAINDER Y 67108864)) 

(RETURN (PLUS A (TIMES Y 

(QUOTIENT (DIFFERENCE B A) 67108864.0)))))) 



<Q-32 LAP LANGUAGE. 



(LAP 



(FUNCTION ((RANDOM . TEST) REAL) 

((A REAL LEXICAL VALUE) (B REAL LEXICAL VALUE)) 
(STF TOP.) 

(BEGIN) 

(LDA (Y . TEST)) 

(MUL 3125 (L567.7 R S» 

(ARCS) 

(STB PUSHA.) 

(LDA (NUMBER 67108864) S) 

(GALL (REMAINDER . LISP)) 

(STF (Y . TEST)) 

(LDC A) 

. (FAD B) 

(FDV (NUMBER 67108864.0)) 

(FMP (Y . TEST)) 

(FAD A) 

(END) 

(RETURN)) 

(((REMAINDER . LISP) FUNCTION 

(FUNCTIONAL INTEGER INTEGER INTEGER) VALUE) 
((Y . TEST) OWN INTEGER VALUE)) 

TEST) 



FIGURE 1-2. A SAMPLE LISP 2 PROGRAM IN SL, IL, AND LAP 
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Plans 

The Q- 32 LISP 2 system will be polished and a 
mechanism for swapping binary programs from disc 
storage will be Installed. An operational Q-32 
system will be completed in the first quarter 
of 1967. 

Implementation specifications for putting 
LISP 2 on an IBM 360-like machine will be written. 
Multiple register usage, virtual memory manage- 
ment, and paging techniques will be included in 
the design work. 

Several problem areas need to be resolved. 

The first is that LISP 2 must be polished to 
reduce its size. Also, the swapping mechanism 
has been designed to help alleviate this problem, 
particularly in view of future "page- turning" 
operating systems. The second problem involves 
finding a target system to which LISP 2 may be 
bootstrapped. Neither the IBM 360 nor the PDP-6 
time-sharing system has progressed fast enough 
to be used as an operational vehicle for LISP 2. 
Lacking a stable, reliable time-sharing environ- 
ment, mobility of LISP 2 onto other systems is 
being compromised. 

Pro lect Documentation 

1. Abrahams, P. W. (Ill), Weissman, C. (SDC) , 
et al. The LISP 2 programming language and 
system. SDC/III document TK-3163. 

September 26, 1966. 28 pp. 

2. Kameny, S. L. and Weissman, C. The Q-32 
LISP 1.5 mod. 2.6 system: Operating system, 
input-output, file, and library functions. 

SDC document TM- 2 3 37/ 103/00. April 11, 1966. 

27 pp. 

3. Levin, M. I. and Berkeley, E. (III). LISP 2 

primer. SDC document TM-2710/101/00 (Draft). 
July 15, 1966. 165 pp. 

4. Firth, D. C. and Kameny, S. L. Syntax of 

LISP 2 tokens. SDC document TM-2710/210/00. 
August 25, 1966. 11 pp. 

5. Kameny, S. L. (SDC) and Hawkinson, L. (III). 

LISP 2 intermediate language. SDC document 
TM-27 10/220/01. July 5, 1966. 56 pp. 

6. Barnett, J. A. SIM, an s-expression pattern- 

matching function. SDC document TM-2710/260/ 
00. June 29, 1966. 6 pp. 



7. Saunders, R. A. (Ill), Barnett, J. A. and 
Firth,- D. C. (SDC). The LISP 2 compiler. 

SDC document TM-2710/ 320/01. February 1, 1966. 
55 pp. 

8. Book ? E. The LISP 2 syntax translator. SDC 
document TM-2710/331/00. April 15, 1966. 

27 pp. 

9. Howard, M. V. Operating instructions for the 
LISP 2 supervisor in the LISP 2 core image. 

SDC document TM-27 10/510/00. October 14, 1966. 
5 pp. 



Data Base Oriented Programming Language * 

E. B. Foote 
R. G. Howard 

Description 

Data management systems, such as LUCID (see 
p. 4-7), have several features distinctly different 
from procedure-oriented compiler systems such as 
FORTRAN or JOVIAL. The first is the English- like 
restricted language of the former. This language 
is valuable because it is user- oriented and easily 
learned. However, it lacks certain power inherent 
in the procedure-oriented languages. The second 
difference— not as apparent to the user but equally 
significant — is tie organization of the data in 
storage. Although languages like FORTRAN and 
JOVIAL allow flexibility in what is stored, and 
in the hierarchical structure between data 
elements, they put the material away, in core or 
tape or disc, in a relatively rigid format. Data 
management systems like LUCID, on the contrary, 
have a very flexible format, dictated by the data 
itself. The goal of this project is to study the 
possibility of obtaining in one system the power 
of the procedure- oriented language and the capa- 
bility offered by the data structure used in the 
data base management system. 

Progress 

The language is being investigated in the 
context of SDC's Time-Shared Data Management 
System (TDMS)— (see p. 4-9). The first step has 



^Supported by the Advanced Research Projects 
Agency. 
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been to study the possibility of building a 
convenient bridge for the user between a data 
management system and a procedure- oriented 
language; that is, to enable a user who has 
built a data base using the manipulative features 
of TDMS to use the same data base for more exten- 
sive computations. For example, a user who has 
built and used an inventory of bomb components 
might wish to use linear programming to determine 
the most bombs he could build out of components 
in his inventory. Such a lengthy computation 
exceeds the capacity of TDMS. The user would 
need to use something like JOVIAL but would 
prefer not to have to convert his data base to 
a new format. 

The use of the TINT* system on the Q-32 computer 
was considered as the procedure-oriented language 
base since TINT is more readily modifiable than 
available compilers. Several new language forms 
were devised to carry data back and forth between 
the formats of TDMS and TINT. However, taking 
material from the tree structure used in TDMS 
into tabular format of TINT can result in wasting 
of a great deal of space, because the tree 
structure does not allow space for absent ele- 
ments while the table must allow space for the 
largest potential entry. Other approaches were 
considered and several appear promising. One of 
these is to add a facility to a compiler such as 
JOVIAL that would enable a user to get elements 
from a data base as he needs them. Then the 
user can perform computations on the elements 
immediately or construct tables based on his 
knowledge of the data. 

Plans 

The JOVIAL compiler for SDC’s S/360 Time- 
Sharing System will be studied to determine if 
it can be matched to the data retrieval package 
of the TIMS system. If this appears feasible, 



the compiler will be modified to furnish a vehicle 
for experiment and further development. 

AIDS TO PROGRAMMING 

Interactive Programming Support System* 

M. B. Bleier 
H. Bratman 
E. R. Clark 
J. S. Hopkins 

Description 

The goal of this project is to develop a pro- 
gramming support system that (1) takes advantage 
of the interactive capability of time-shared 
systems, (2) facilitates the programmers' work 
by providing an integrated capability in system 
language, information, and function, and (3) makes 
advantageous use of a tabular display in program 
composition, program editing, program debugging, 
and program and system documentation. 

Among the functions that will be provided in 
the system are: 

Program Composition: Original preparation of 

source- language program 

Alteration (temporarily 
or permanently) of 
source- language program 

Maintenance of program 
and data files 

Program Compilation: Grammar checking 

Partial recompilation 

Compilation of optimum 
object code 

Program Testing: Display of variables that 

are selected by symbolic 
name 

Symbolic alteration of 
values of variables 

Tracing 

Assistance in preparing 
and storing data for 
program parameter testing 

Display of variables 

controlled by conditional 
statements 




★Teletype INT erpreter--an on-line assembly- 
oriented language system. 



★Supported in part by Rome Air Development Center. 
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Documentation: Production of "set-used" 

information and glossary 
of names used in the 
program 

Reformatting of source- 
language program text 

Production of flow charts 
containing variable 
levels of program 
details 

Assistance to user in 
learning the system 

The system is designed to give the user an 
integrated language for the program production 
processes of composition, editing, compilation, 
execution, testing, and documentation. The 
commands for performing a particular action are 
all in the same format, even though the action 
may occur in different functions. For example, 
once a programmer has learned the commands to 
vinsert and delete statements in a program, he 
is able to perform simple kinds of file mainte- 
nance without learning new or contradictory 
commands. 

Also, in developing a program, the user will 
deal with only a single entity, the system itself, 
instead of being concerned with a compiler, an 
editor, a debug package, etc. He can perform 
actions as needed without awareness that difrerent 
functions are being used. For example, if the 
user gives a command to execute a program and 
that program has not been compiled, the system 
will compile it for him. The system is designed 
for programmers rather than problem solvers. 
However, the system and its language are designed 
to be learnable in steps, so that a novice with 
a simple problem would need to learn only a small 
part of the system. 

In addition to providing an integrated language, 
the system exploits the capability of time- shared 
interaction, especially during compilation. 

During compilation the compiler usually makes 
some arbitrary decisions. For example, when 
there are more index variables than registers, 
the compiler uses predetermined rules to assign 



registers. In an on-line, interactive situation, 
the compiler can ask the programmer for advice. 

If the programmer has information about the 
frequency of use of some or all of the indexes, 
the compiler can use this information to make 

4 

more efficient assignments. 

Another feature of the system is improved 
coordination of programs, providing better 
service to the user. Increased efficiency in 
operating time is expected, because programs 
within the system do not regenerate information 
that has already been computed. In particular, 
the compiler provides a great deal of information 
that is useful in debugging. An example ig the 
dictionary generated during compilation, which is 
currently being used by some debugging routines. 
The flow chart and glossary programs need much 
the same information about the structure of a 
program generated by a compiler with line-at-a- 
time or partial recompilation capability. 

Figure 1-3 shows the relationship between system 
components. 

Progress 

Investigation is proceeding along three paths: 

. Design of the system language and the 
techniques of user- system interactions. 

. Design of the system functions that support 
the system capabilities. 

. Design of system control and program 
integration. 

1. Design of the System Language . The conver- 
sational requirements of various components of 
the system, such as program composition, compila- 
tion, testing, and documentation are being 
studied. As the requirements are formulated, 
an integrated language is being developed. The 
development of the system design concurrent with 
the language design ensures that the system 
contains all the capabilities implied by the 
language. A document has been published that 
describes the system from a user's point of 
view 1 3 ]i . 
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FIGURE 1-3. COMPONENTS CF THE INTERACTIVE PROGRAMMING SUPPORT SYSTEM 
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2. Design of Various System Functions and 
Tables . A version of the syntax analyzer is 
being checked out on the Q-32. This component 
performs the following functions: 

. Analyzes syntax of all inputs to see if they 
are correct and indicates location of error if 
they are not. 

. If the input is a system command, encodes it 
in binary and searches the dictionary for any 
variable names in the command. 

. If the input is a JOVIAL statement, removes 
redundant blanks from the statement and makes 
entries in the dictionary for variables used or 
defined. 

The syntax analyzer is being written in JOVIAL 
and uses techniques developed by the META and 
META5 metacompilers (p. 1-3) . This will result 
in an efficient program that can be easily 
modified for changes in input syntax. 

Formats that facilitate the attaching and 
detaching of program units and system commands 
to the main body of the program have been 
proposed for the Text Structure Directory (TSD) 
and the Program Structure List (PSL). Algorithms 
have been devised to attach, locate, and trace 
program units in the proposed TSD format. A 
scheme to encode System commands, consistent with 
the proposed TSD format, has been devised. 

Research is being conducted in the area of 
partial recompilation, which combines the 
problems of program editing and incremental 
compilation with those of modifying executable 
programs. The program text is maintained in a 
list form to facilitate program editing. State- 
ments are compiled into independent subroutines 
whose computation sequence is described in the 
PSL. An execution monitor is being designed to 
operate on the output of the compiler. This 
component executes the independent program 
segments in the proper sequence and directs the 
operation of any program test requests made by 
the user. 



3. Design of System Control and Program Inte - 
gration . A central controller and input/output 
control are being designed to operate within 
SDC's S/360 Time-Sharing System. A study of the 
Time- Sharing System is under way to determine 
its optimum use with the Interactive Programming 
Support System. A prototype controller will be 
written to operate on the 360 with dummy system 
components . 

Plans 

Work on the Interactive Programming Support 
System will concentrate on completing the system 
design and building the framework cf the system. 
The aim will be to build a mockup of the system, 
with certain parts "running" and certain parts 
only represented, in order to meet the various 
constraints on resources. The emphasis will be 
on developing the system language and showing the 
forms of user interaction. The aim is to gain 
experience in using the system and testing the 
effectiveness of the program composition and 
editing language and function. 

The tasks to be worked on in the immediate 
future include: 

. Development of the system language. 

. Design and construction of the program 
composition and editing programs. 

. Completion and revision of the syntax 
analyzer. 

. Design and construction of the system control 
program. 

. Modification of the existing JS S/360 compiler 
to be operable under the control program, match 
the output of the editing program, provide some 
interactive capability, and match existing testing 
programs . 

Pro i e c t Documentation 

1. Jacobs, E. H. An interactive programming 
support system. SBC document TM-2819. 

January 1, 1966. 4 pp. 

2. Beeler, R. G. , Jackson, C. W. , Hopkins, J. S. 

and Jacobs, E. H. Interactive programming 
support system. SDC document TM- 28 19/00 1/00. 
February 21, 1966. 9 pp. 
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3. Bratman, H. and Hopkins, J. S. Initial design 
specifications for the interactive programming 
support system. SDC document TM-2819/002/00. 
July 20, 1966. 54 pp. 

Automated: Flow Charting 
!,. Fine 

Description 

The flow chart is a traditional programming 
tool used in program design, production, checkout, 
maintenance, and documentation. It provides a 
means for correlating what a program actually 
does with what it is meant to do. The level of 
detail desired in a flow chart is a function of 
its intended use. For some purposes a general- 
ized flow chart, giving an overview of a program, 
is most useful, while for other purposes a 
detailed flow chart, giving the exact flow of 
a program, is most appropriate. However, the 
production and maintenance of such flow charts 
is often in itself an overwhelming, and hence 
rather neglected, task. For this reason, a 
program that takes a symbolic program and from 
it automatically produces multilevel flow charts 
can serve an important purpose. In addition to 
relieving the programmer of the tedious aspects 
of flow diagramming, such a program produces 
flow charts that have a consistent degree of 
accuracy and a standardized format. 

This project is developing a method for auto- 
matically producing multilevel flow charts. The 
most detailed flow chart is first produced and 
then, by iteratively applying a set of algorithms, 
it is successively condensed to produce flow 
charts at various levels of detail. 

Progress 

As a start on the problem of analyzing symbolic 
code, a program (SURE) was written which would 
take a program written in JTS (JOVIAL for Time- 
sharing) and produce complete set/used infor- 



is, SURE can reformat a program so that it will 
be shorter and will execute faster. This work 
appeared sc promising that a separate project 
aimed at automatic code improvement was started 
(see p. 1 - 21 ) while the automatic flow charting 
work continued. 

.. method of producing multilevel flow charts 
has been implemented in JOVIAL. It is currently 4 

operating on the Q-32 computer, and will be 
adapted to operate on SDC's S/360 Time-Sharing , 

System. The program takes a source program 
written in a subset of JOVIAL, determines the 
function of each source program statement, and 
assigns to each statement (other than a branch- 
type statement) a separate box that indicates 
its function. Text that describes the action of 
the source statement is placed in each box, and < 

flow information is associated with each box. 

Finally, certain adjacent boxes are grouped 
together, thus producing the most detailed flow 
chart . 

A study of the number of entries to, and exits 
from, each box, as well as some information about 
program flow, indicated that certain configurations 
of boxes can be grouped together without essen- 
tially altering the picture of program flow. This ^ 

study led to the development of a set of seven 
rules that, when applied to the flow chart, 
condense it by grouping together certain config- 
urations of boxes and thus produce a more general- 
ized flow chart. By reapplying the rules to this 
flow chart the next- level chart is produced. The 
seven rules specify how different boxes are 
combined, how the text is modified, and how the 
flow information is updated. Initially, simple 
program structures are grouped together and then 
more complex structures are combined, until 
finally the flow chart is in its most generalized 
form. 



• ffSsI 1 



mat ion. This program was then augmented to 
perform more detailed analysis and was found to 
be useful in improving the symbolic code. That 



The method and the program that implements it 
are still in the developmental stage. The program 
checkout has been completed 1 I it very little 
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experimentation has been done. The results of 
applying tlje rules are encouraging but the rules 
do not reduce all source programs to a most 
generalized flow chart consisting of a single 
box. 

The output of the flow charting program is a 
printer-drawn flow chart. No attempt has been 
made to produce a well-drawn picture, since this 
project is primarily concerned with the analysis 
techniques. A number of researchers are working 
on the "drawing" problem and it is expected that 
this project will utilize their results. 

Plans 

The next step will be to generate a number of 
computer-produced flow charts to be used in 
evaluating and refining the method. A study will 
be made to see if the seven rules can be extended 
to reduce all source programs to one-box repre- 
sentations. Further investigation will also be 
directed toward determining what constitutes a 
flow chart level. 

The present output will be replaced with a more 
Sophisticated "picture- drawing" capability , 
preferably by adapting one of the flow charting 
programs already implemented outside of SDC, and 
an on-line display console will be used as an 
output device. Long-range plans include an 
investigation of user interaction with the 
program, a feature that is not possible now. 

Project Documentat ion 

1. Fine, L. Automated flow diagramming- -boxes 
and text. SDC document TM-2969/001/00. 

April 18, 1966. 6/ pp, 

2. Fine, I,. Automated multilevel flow 
diagramming. SDC document SP-2629. 

October 26, 1966. 19 pp. 



Graph 1 c Input /Output* 

.1. I. Bernstein 

Description 

The primary purpose of this project is to extend 
on-line programming facilities and techniques 
through the use of graphic input /output equipment- - 
namely, the RAND Graphic Input Tablet, CRT displays, 
and their associated hardware. The attainment of 
this goal requires the development of character 
and shape recognition routines that function effi- 
ciently in an on-line time-shared environment, and 
the integration of these routines into both current 
on-line programming systems and extended program- 
ming systems. To be useful, the character 
recognition routines must not only be able to 
recognize inputs from a large vaiiety of users, 
but must provide the users with a relatively 
large character set. The extended systems include 
languages for which the placement of characters 
is more meaningful and natural in a two-dimensional 
format; for example, in flow charts, automated 
analysis and manipulation programs, and theorem- 
proving and game-playing programs. 



The central ideas of the character recognition 
techniques being developed are that maximum use 
be made of the serial nature of the real-time 
input data and that the principal unit of infor- 
mation is the stroke. The program extracts 
descriptors in the form of "feature" strings 
from each stroke as it occurs. Multistroke 
characters require that the strokes constituting 
them have the proper spatial relations. 

In order to tailor the character recognition 
programs to a given user, they are being con- 
structed to allow each user to provide his own 
input vocabulary of characters and to associate 
these with the desired output character. During 
this procedure a control program constructs a 



★Supported in part by the Advanced Research 
Projects Agency. 
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dictionary for the user and allows him to test 
the level of achievement in recognition. If the 
user is unsatisfied with the results, he may add 
samples until the recognition rate is acceptable. 

To date, four feature extractors arid two 
methods of analyzing the spatial relations 
between strokes have been programmed for the 
Fhilco 2000 using a RAND Graphic Input Tablet 
for input and its associated display for output. 

One extractor generates a "feature" each time 
it detects a local minimum or maximum in the X 
or Y coordinates (i.e., eaeh time the writing 
instrument reverses direction of its left-right 
or up-down motion). Another divides the stroke 
area (the rectangle circumscribing the stroke as 
defined by the absolute minimum and maximum 
values of X and Y) into four equal areas upon 
completion of the stroke. The "feature" list 
consists of the subareas through which the stroke 
passed in order of occurrence. A third "feature" 
extractor divides the stroke area into five sub- 
areas by superimposing a diamond- shaped area at 
the center of the rectangle. The fourth extractor 
fits straight-line segments to the stroke path 
and measures local curvature in the path by 
computing the angular change between adjacent 
line segments. "Features" in this case are a 
function of the collected local angular changes 
that are all In the same direction of rotation 
(clockwise or counterclockwise). Collection is 
terminated by a change in direction of rotation, 
by a sharp angular change regardless of direction 
of rotation, or by the end of the stroke. The 
"features" themselves are classifications based 
upon the amount and direction of curvature. 

Of the two methods of analyzing the spatial 
relations between strokes, only one has proved 
useful to date. It classifies the relationship 
of a pair of consecutive stroke rectangles into 
one of three classes: coincident, proximate, or 

unrelated. To those classified as proximate or 
unrelated, a quantized direction is added. For 



this purpose direction is quantized into eight 
headings. 

The other analyzer, which generates the relation- 
ship between the imaginary line segments formed by 
joining the stroke end points, has been temporarily 
laid aside until more work can be done to provide 
better discrimination. 

The feature extractors and the first spatial 
analyzer have been integrated in a basic control 
package which handles the input/output chores and 
the construction of the stroke-character dictio- 
nary. The dictionary is a tree. All characters 
with common beginning strokes have the same path 
in the tree, no branch occurring until a dif- 
ference occurs. Thus, a "1," a "P," and an "R" 
occupy one path in the tree, with the "!" and the 
"P" as intermediate and defined sub-elements of 
an "R." 

Tests with each of these techniques showed a 
variety of shortcomings. 

The feature extractor based upon the detection 
of local minima or maxima in X and Y was overly 
discriminating and required too many samples for 
adequate recognition. The area feature extractors 
were unable to consistently discriminate between 
all the 99 characters in the test set. The ex- 
tractor based upon measurements of path curvature 
and rotation was not completed but appeared to 
be overly discriminating. 

Several techniq les for better smoothing and 
filtering were tried in order to improve the 
min-max extractor, in the belief that some of 
the problem was due to noise. This did not prove 
to be the case and further work on this extractor 
has been suspended. 

Using similar techniques on the curvature feature 
extractor showed that noise waa not its principal 
problem, but rather that there was some incon- 
sistency in determining the proper termination 
point for each feature in complex characters. 

In an attempt to solve this problem, a corner 
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FIGURE 1-4. GRAPHIC INPUT /OUTPUT EXPERIMENTATION 

Experimenter writes character on RAN© Tablet with stylus. The display scope shows the large 
hand-drawn character, plus the computer-recognized matching character immediately above it. 
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detector was developed that looked at both the 
local geometry and path velocity for clues in 
locating a corner. The corner detector proved 
successful and was incorporated in both the area 
feature extractor (using five subareas) and the 
curvature feature extractor. This addition 
enabled the area feature extractor to provide 
adequate recognition for at least one user on a 
set of 99 characters that includes the upper and 
lower case Roman alphabet, louer case Greek 
alphabet, 10 digits and 13 punctuation marks. 

The computer on which the work was being done 
(the Phileo 2000) was turned off before completion 
of modifications and testing could be done on the 
curvature extractor. 

Plans 

In an attempt to smooth the transition from 
the Phileo 2000 to the IBM S/360, some RAND 
Tablet outputs of drawn characters were recorded 
on tape and converted to IBM format. This tape 
will permit preliminary work to proceed on the 
S/360 without full hardware capability and will 
provide a controlled mechanism for program 
improvements and testing. 

Both the curvature and area feature extractors 
will be programmed for the IBM S/360 to work 
under the SDC Time-Sharing System, initially 
incorporating the most obvious improvements. 

When the programs are brought to the appropriate 
state, the existing versions will be tested for 
various capabilities, with a typical set of 
potential users. Work will also begin on 
Imbedding the recognizer in a system that allows 
a programmer to prepare a computer routine di- 
rectly on the RAND Tablet. 

Project Documentation 

1. Bernstein, M. I. An on-line system for 

utilizing hand-printed input. SDC document 
TM-3052. July 11, 1966. 19 pp. 

2. Bernstein, M. I. Some system considerations 
for on-line character recognition in a time- 
sharing system. SDC document SP-2636. 

October 28, 1966. 11 pp. 



Automatic Code Improvement 
E. R. Clark 

Description 

Almost any program of a certain minimal com- 
plexity can be improved 1 . Improvement may mean 
making the program shorter in storage space 
required, faster in execution time, or rearranged 
so that the program logic is easier to understand. 
Undoubtedly, these improvements can be done most 
effectively by an experienced programmer who 
understands what the program is to do. Since 
such people are not always available, and, even 
if they are, the process is time consuming and 
expensive, a processor has been written to 
analyze a program automatically and attempt to 
improve it. The processor needs no understanding 
of what the program is supposed tc? do; its object 
is to produce a better program that will do the 
same thing. 

Progress 

A program called SURE (Set-Used RE formatter), 
has been developed which accepts a program 
written in JOVIAL, automatically looks for any 
of 11 specific situations, and improves the 
program if any of these are found. The particular 
methods implemented were selected by weighing the 
difficulty of detecting and improving a given 
situation against the likelihood that such a 
situation will occur. The improx'ements made in 
a well-written program may not be very signifi- 
cant, but the processing time of SURE is not 
great, and the processed program is permanently 
improved since the changes are made at the 
symbolic language level. A poorly written 
program can, of course, benefit even more. 

The original version of SURE generated an inter- 
mediate language which was processed for possible 
improvements. SURE was rewritten to make its 
improvements by working directly on the symbolic 
statements. This version makes the same improve- 
ments as before but, not requiring an intermediate 
language, is about one- third shorter. SURE 
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originally accepted the JTS version of JOVIAL as 
input. It now accepts the full J3 language in 
addition to the special primitives belonging to 
JTS. 

Plans 

SURE will be modified to run under SDC's S/360 
Time-Sharing System. Additional methods of 
automatic improvements might be implemented if 
the need for a particular improvement is demon- 
strated. 

Pro iee t Doeumentat ion 

1. Clark, E. IS. Executive service: Set /used 

reformat processor (SURE). SDC document 
TM- 2 708/ 205/00. April 7, 1966. 13 pp. 

2. Clark, E. U. On the automatic simplification 

of source- language programs. SDC document 
SP-2389. April 11, 1966. 20 pp. 

COMPUTER PROGRAMMING MANAGEMENT 

Program Cost Analysis * 

T. Fleishman 
V. LaBolle 
IS. A. Nelson 
6. F. Weinwurm 

H. J. Zagorski (Defense Systems Division) 
Description 

The aim of this project is to develop techniques, 
standards, and guidelines for managers of. computer 
programming projects, to aid them in planning, 
controlling, and estimating the costs of computer 
programming Jobs. This work has been character- 
ized by three major steps, namely: (1) the col- 

lection of data that detail the costs and cost 
factors for completed programming jobs by means 
of a questionnaire, (2) the validation of these 
data to eliminate errors in the data, and (3) the 
use of statistical techniques, e.g. , multivariate 
regression, to derive numerical relationships, 
primarily linear equations, for estimating the 
costs of proposed computer programming projects. 

^Supported by the Air Force Electronic Systems 
Division, Directorate of Computers. 



Progress 

Using the foundation provided by earlier results* 
from an analysis of data on 74 programming projects 
completed at SDC, project members did additional 
work to derive estimating equations that were both 
easier to use and more accurate. To make the 
equations easier to use. the logarithmic trans- 
formation adopted earlier for some variables was 
dropped- -especially for the major cost measures 
used as dependent variables, i.e., man-months, 
computer hours, and elapsed time in months. As a 
result, a derived statistic cuch as the standard 
error of estimate for man-months could be measured 
in those units rathar than in their log, and the 
meaning of the results could be readily inter- 
preted. To improve the accuracy of the equations, 
seven data points with excessive cost measures 
were eliminated. This truncated sample was used 
to derive equations that reduced the strong 
influence of these outliers. To further increase 
the statistical precision, the remaining total 
sample (N *• 6?) was divided into three subsamples 
based on the program size measured in man-months. 

Concurrent with this work, an effort was under 
way to validate additional data collected earlier 
to form a more homogeneous and representative 
sample from sources other than SDC. These new 
data were to be merged with the SDC data to form 
a larger data base that would be analyzed in 
similar ways. The new analysis was to stress 
sub samp ling as a way to derive more accurate 
equations. 

In the latter part of 1965, over one hundred 
data points from completed programming efforts 
were collected from 8 industrial organizations 
and .4 U. S. Air Force agencies. After authentica- 
tion of these data in the spring of 1966, the newly 
acquired data points were combined with the SDC 



*See SDC document TM-2712, Research into the 
Management of Computer Programming: A 

Transitional Analysis of Cost Estimation 
Techniques, November 12, 1965. 
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data on hand, to form a total data base of 169 
data points as inputs to the analysis. 



Four subsamples were analysed as meaningful 
ways to divide the data in terms of costs: 

(1) Programming Application- -a division of 
programs into categories- -Business, Scientific, 
Utility and Support, and Other (a miscellaneous 
category such as ccmnand and control, research 
and development); (2) Program Source Language, 
a separation between programs written in machine- 
oriented languages and procedure-oriented lan- 
guages; (3) Production Computer Size, a division 
based on equivalent purchase price of small, 
medium, and large computers; and (4) Stand-Alone 
versus System Program, "one-shot" programs 
produced as single entities versus programs 
created as integral parts of a larger information 
processing system. 



Within each of the four groups, statistical 
tests of significance were performed on the 
means of the following variables: Man-Months, 

Computer Hours, Elapsed Time in Months, Number 
of Object Instructions, Source and Object Pro- 
duction Rate 8 (Instruct ions /Man- Months), and 
Source and Object Computer Usage Rates (Computer 
Hours/1000 Instructions). The results of these 
tests on the data (see Table 1-1) showed the 
following: 

. Utility and Support programs are more costly 
to produce than the other three applications. 

. Business (file-oriented) programs are less 
costly than the three other program applications. 



TABLE 1-1. MEAN COSTS, PRODUCTION RATES, AND 

COMPUTER USAGE RATES BY APPLICATION 



Application 


Number 

of 

Points 


Mean 

Man- 

Months 


Mean 
Cmptr 
Hr s 


Mean 
Object 
Instr /Man* 
Month 


Mean 
Computer 
Hrs/1000 
Objec t 
Instr i 


Business 


79 


13.2 


73 


1521 


12 


Scientific 


1 27 


42.0 


137 


882 


18 


Utility and 
Support 


28 


92.7 


766 


410 


57 


Other 


35 | 


54.7 


263 


292 


30 



. Procedure-Oriented Languages are more effec- 
tive (see Table 1-2), i.e., have lower resource 
use, object instructions, and computer usage 
rates, than Machine-Oriented Languages. 

. The average expansion ratio is approximately 
3.3 Machine Language Instructions to one Procedure- 
Oriented Language Instruction. 



TABLE 1-2. MEAN COSTS, PRODUCTION RATES, 
AND COMPUTER USAGE RATES BY 
PROGRAMMING LANGUAGE 



Application 


Number 

of: 

Point 8 


Mean 

Man- 

Months 


Mean 
Cmptr 
Hr s 


! 

Mean 
Object 
In str /Man- 
Month 


I Mean 
Computer 
Hrs/1000 
Object 
1 Instr 


Machine- 

Oriented 


123 


48 1 


289 


610 


30 ! 


Procedure- 

Oriented 


46 


18 | 


99 


1977 I 


1 10 



The data for the other two subsamples, Computer 
Size and Stand-Alone/System, showed no conclusive 
result 8. 



Several sets of estimating equations were derived 
for the total sample and for a number of subsamples. 
These results, as well as others, were summarized 
in a handbook for estimation of computer program- 
ming costs. The equations for the subsamples 
presented in the Management Handbook ft 3] exhibit 
better statistical precision than the equations 
for the total sample. 

The Management Handbook also contains other 
material based upon technical literature and the 
experience of project members. These guidelines 
are intended to help managers estimate the cost 
of computer program development. In the handbook 
the computer programming process is divided into 
six categories: Preliminary Planning and Cost 

Evaluation; Information System Analysis and 
Design; Computer Program Design, Code, and Test; 
Information System Integration Test; Information 
System Installation and Turnover; and Computer 
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Program Maintenance. Each of these process 
steps is described in terms of tasks , inputs, 
and outputs. For each step a number of cost 
factors are listed together with some statistical 
and/or intuitive indication of their influence 
on costs. Planning factors such as unit costs 
or percentage- of -other- item costs are also given 
for the process steps. Examples of forms for 
recording cost estimates are also included. 

This Handbook should be interpreted as an 
initial effort to present the manager with a 
comprehensive set of applicable guidelines. 

If feedback indicates that guidelines in this 
form are useful the Handbook should be sup- 
plemented and revised as more information becomes 
available through further research and develop- 
ment . 

Plans 

No further analysis of the collected data is 
scheduled at this time; however, the analytical 
results to date and the data ba?e could be used 
in future project endeavors, such as the one 
planned for the first part of 1967, namely, the 
continued development of an operational system 
to collect cost data during the program develop- 
ment process. 

Pro 1 ect Documents t ion 

1. LaBolle, V. Development of equations for 

estimating the costs of computer program 
production. SDC document TM-2918. April 5, 
1966. 49 pp. 

2. Fleishman, T. Current results from the 
analysis of cost data for computer programming. 
SDC document TM- 3026/000/01. July 26, 1966. 

97 pp. 

3. Nelson, E. A. Management handbook for the 
estimation of computer programming costs. 

SDC document TM-3225. October 31, 1966. 

141 pp. 

4. LaBolle, V. Development of aids for managers 
of computer programming. Journal of Industrial 
Engineering . 1966, ,17 (11), 564-571. 



A System for Reporting Cost Data for 
Computer Programming* 

L. Farr (Advanced Systems Division) 

T. Fleishman 

V. LaBolle 

E. A. Nelson 

C. L. Starkey (Defense Systems Division) 

G. F. Weinwurm 

Description 

In the work to develop estimating equations for 
computer program development costs (see p. 1-22), 
one major difficulty was obtaining data that 
relate products to costs. Even when these data 
were available, they showed the poor quality of 
"after- the- fact" data- -unstructured , ambiguous, 
and disorganized. Recorded data were aimed mainly 
at organizational or contractual accounting and 
not at planning and control of computer program- 
ming projects. The reporting system being 
developed is intended ror use during a computer 
programming project. In addition to promoting 
the recording of uniform data that can be 
compared from project to project, the system 
should provide information for cost control of 
individual projects and inputs to a data bank 
that can be analyzed to supply improved planning 
factors. 

Progress 

The initial effort in the development of a 
reporting r'Stem was completed during the first 
part of 19 j. This work used a definition of 
the computer programming process as a context in 
which to identify and define proposed data elements 
to be collected in the system. 

The programming process was broken down into the 
following seven steps: Information Processing 

Analysis; Information Processing Desi*- ■»; Computer 
Program Design; Computer Coding and checkout; 
Computer Program Functional Test; Information 
Processing Integration Test; and Information 
Processing Installation and Implementation. 

♦Supported by the Air Force Electronic Systems 
Division, Directorate of Computers. 
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Each step identifies a stage in which a subset 
of the proposed set of data elements is to be 
collected. 

Several forms were developed to collect the 
data at each process step. Separate forms were 
proposed for (1) collection of both cost data 
and technical data, (2) tracking of estimated 
and actual costs and cost factors through the 
life cycle of a computer programming project, 
and (3) formation of a quantitative history of 
resource expenditure patterns. 

The system is intended to provide a basis for 
collection of comparable (uniform) cost and tech- 
nical data from computer programming development 
projects whether performed "in-house" by the 
Air Force or by a subcontractor. 

The first version was designed to be compatible 
with existing management and budgetary systems, 
e.g. , the Program Budget, Cost Information 
Reports (CIR), and System Program Management 
Procedures, as described in the AFSCM 375 series. 
As part of a task to consult for the SAC Airborne 
Data Automation Project, members of the Program- 
ming Management Project made recommendations for 
data collection on computer programming work. 

Plans 

Review of the completed work by managers has 
suggested areas of potential improvements, such 
as clarifying the definitions of the cost and 
technical data items. The continued work is 
aimed at making these improvements. Plans for 
the early part of 1967 call for testing the 
feasibility of the system in an Air Force agency 
that is responsible for a large number of pro- 
gramming projects. 

Project Documentation 

1, Weinwurm, G. F. Data elements for a cost 
reporting system for computer program 
development. SDC document TM- 2934/000/02. 
July 6, 1966. 83 pp. 



2. LaBolle, V. and Fleishman, T. Programming 
management project consulting for the SAC 
airborne data automation project. SDC document 
TM- 3094 . August 18, 1966. 24 pp. 



COMPLETED STUDIES 

The following studies in the Advanced Programming 
area were completed prior to 1966 and are not 
described in this report. 

Compiler Construction Techniques 

1. Book, E. The LISP version of the META compiler. 

SDC /III document TM- 2710/330/00. November 2, 
1965. 11 pp. 

2. Book, E. and Bratman, H. Using compilers to 

build compilers. SDC document SP-176. August 
31, 1960. 11 pp. 

3. Book, E. , Bratman, H. , Schwartz, J. I. A one- 

pass JOVIAL compiler. SDC document TM-970/ 
001/00. January 22, 1963. 3 pp. 

4. Book, E., Bratman, H. , and Schwartz, J. I. 

J0VIAL-X.2, the language of the one-pass 
JOVIAL compiler. SDC document TM- 970/002/00. 
January 31, 1963. 11 pp. 

5. Cohen, V. L. and Hopkins, J. 5. Phase I of 

the JOVIAL generator (NGENi) . SDC document 
TM- 555/021/01. March 17, 1965. 343 pp. 

6. Dobrusky, W. B. Design for JOVIAL compiler 

for the small computer. SDC document TM-739. 
August 7, 1962. 9 pp. 

7. England, D. E. and Clark, E. R. The CLIP 
translator. Communications of the ACM , 1961, 

4, 19-22. (Also available as SDC document 
FN-4455.) 

8. Oppenheim, D. K. The META5 language and 
system. SDC document TM-2396. July 21, 1965. 
49 pp. 

Programming Languages 

1. Foote, E. B. and Sandin, N. A. JTS users' 
manual. SDC document TM- 157 7/000/01. April 

8, 1965. 63 pp. 

2. Isbitz, H. A formal description of CLIP. 

SDC document TM-543. October 14, 1960. 18 pp. 

3. Kameny, S. L. LISP 1.5 reference manual for 
Q-32. SDC document TM- 2337/101/00. August 

9, 1965. 85 pp. 

4. Kameny, S. L. Input-output file and library 

functions: The Q-32 LISP 1.5 mod. 2.5 system. 
SDC document TM- 2337/ 102/00. September 22, 
1965. 14 pp. 
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5. Perstein, M. H. The JOVIAL (J3) grammar and 
lexicon. St', document TM- 555/002/04. 

October 20, 1965. 138 pp. 

6. Shaw, C. J. A comparative evaluation of 
JOVIAL and FORTRAN IV. Automatic Programming 
Information . 1964, (22), 1-15. 

7. Weissman, C. LISP primer: A self- tutor for 
Q-32 LISP 1.5. SDG document TM- 2337/010/00. 

June 14, 1965. 166 pp. 

Computer Programming Management 

1. Farr, L. » LaBolle, V., and Willmorth, N. E. 

Planning guide for computer program development. 
SDC document TM-2314. May 10, 1965. 179 pp. 

2. Farr, L. Quantitative analysis of computer 
programming cost factors: A progress report. 

SDC document SP-2036. August 26, 1965. 19 pp. 

3. LaBolle, V. Office of naval research/computer 
program implementation process: Final report. 

May 1965. SDC document TM- 1954/004/00. 

June 3, 1965. 25 pp. 

4. Nelson, E. A. Research into the management of 
computer programming: Some characteristics of 
programming cost data from government and 
industry. SDC document TM-2704/000/00. 

November 15, 1965. 43 pp. 

5. Peach, P. Quality control for computer 
programming: A final report on an initial 
study. SDC document TM- 2313/001/00. 

September 9, 1965. 17 pp. 

6. Weinwurm, G. F. Research into the management 
of computer programming: A transitional 
analysis of cost estimation techniques. 

SDC document TM- 27 12/000/00. November 12, 

1965. 203 pp. 
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INFORMATION PROCESSING RESEARCH 



T. B. Steel, Jr. , Head 



The activities oi. cne Information Processing 
Research staff for the past year fall into three 
main categories. First, there are studies in the 
area of formal models of information processing 
concerned with the development and analysis of 
theoretical descriptions of various aspects of 
information processing systems. Second, there is 
an effort to study and develop information pro- 
cessing systems intended to promote more effective 
cooperation in man-machine teams in problem- 
solving contexts. Finally, some work is continu- 
ing in attempts to advance the frontier of support 
technology in the area of programming languages. 
This last work is reported under the topic of 
Advanced Programming (see p.l-1 et seq.). 

The importance and relevance of the first class 
of projects derive from the phenomenal growth of 
the information processing field. As a result of 
this growth, most developments in technique have 
been ad hoc and empirical; theory is virtually 
nonexistent. The history of science strongly 
Suggests that, unless the development of a 
theoretical framework begins to catch up with 
ad hoc studies, and organize them, progress will 
slow down. Thus, the principal objective of 
these projects is the pursuit of theoretical 
investigations into the nature of information 
processing procedures. While the means of inves- 
tigation may occasionally be empirical and parti- 
cular, the objectives remain theoretical and 
general. 



A common theme among this first set of studies 
is the development of abstract models covering 
machines, programming languages, procedures, and 



algorithms. Although the individual models are 
disjointed and sometimes even mutually contra- 
dictory, the employment of common research 
techniques is intended to lead to a rational 
attempt to locate integrating principles among the 
various theories. This latter endeavor, while 
not formulated as an explicit study, is an under- 
lying researca objective. 

The principal achievement of this research is 
the development of fundamental insights into 
information processing principles. In particular, 
continuing study of the relationship between the 
phrase- structure grammars developed by linguists 
and the procedure- oriented programming languages 
is important not only to a comprehension of the 
general structure of language, but to the improve- 
ment of ways in which languages of all kinds may 
be developed and processed. Additionally, the 
growing understanding of several models of data 
processing in terms of formal logic, in particular 
a model of question asking, shows promise of 
providing substantial tools for integrating a 
variety of information processing problems in a 
form susceptible to analysis. 

The second category of studies-- in the area of 
man-machine partnership — is evolving in a signifi- 
cant manner. In previous years these studies were 
a disparate collection whose principal connection 
was the fact that they were all concerned with 
"artificial intelligence." During this past year 
the Information Processing Research staff, 
together with representatives of other staffs, 
conducted a study into the meaning and utility of 
artificial intelligence research both in the 
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general research community and in the particular 
context of SDC's mission. The study concluded 
that "pure" artificial intelligence research was 
not of central interest to SDC's mission, and, 
indeed, was becoming a less attractive research 
area in general. However, exploration of the 
possibilities of man-machine interaction with the 
objective of augmenting man's intellect was not 
only of key importance to SDC, but also held 
promise of great general value and interest. 

As a result of this conclusion, together with 
the recognition that the skills and techniques 
required are similar to those previously being 
employed in the artificial intelligence area, 
a gradual redirection of this research is under- 
way. The change in title of the "Research in 
Adaptive Programming" project to "Problem Solving 
and Learning by Man-Machine Teams" reflects this 
movement . A new study, "Augmented Statisti- 
cian," is an example of a direct attack in the 
new direction. As time goes on, this new area 
will come to characterize this part of the staff's 
work. 

During 1966 several projects previously under 
investigation were terminated (see Completed 
Studies, p. 2-14) . One significant development 



has been the continuing withdrawal of SDC 
involvement in several projects where the principal 
investigator has been a consultant to SDC. In 
these cases, termination of SDC's participation 
has not meant a cessation of the research but 
merely a shift in the institutional arrangements 
for the study. 






In summary, during 1966 the Information Processing 
Research staff has reoriented its activities some- 




what toward an emphasis on problems more central 
to SDC's mission. It has also worked toward 
bridging the gap between research and application, 
both by better communication and by research 
emphasis. One of its chief aims continues to be 
the development of a science of information 




processing . 

Note: The work of several of the following 

members of, or consultants to, the Information 
Processing Research staff is described elsewhere 
in this report: 





3. Y. Sedelow, T. L. Ruggles - Stylistic Analysis 

(see under Language Processing & Retrieval - 
p. 5-17) 

D. P. Haggerty - Translation Between Procedure- 
Oriented Languages 
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W. E. Meyer - PL/I for SDC 360 TSS 

(see under Advanced Programming - p. 1-l et seq.) 
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FORMAL MODELS OF INFORMATION PROCESSING 

Theory of Ale orithmic Languages* 

J . Doner 

S. Ginsburg 

T. N. Hibbard 

G. F. Rose 

Consultants: S. A. Greibach, Harvard University; 

M. A. Harrison, University of California, 

Berkeley; E. H. Spanier, University of Calif- 
ornia, Berkeley; J. S. Ullian, Washington 

University 

Description 

A serious drawback in the application of modem 
data processing systems is the cost and time 
consumed in programming these complexes. The 
user's problems and their solutions are described 
in a language such as English. To use the 
services of a data processor, this descriptive 
language must be converted into machine language; 
that is, into program steps. In recent years, 
attempts have arisen to bridge gaps by construct- 
ing programming languages that are: 

1. Rich enough to allow a description of the 
solution of a wide range of problems. 

2. Reasonably close to the user's ordinary 
language of description and solution. 

3. Formal enough to permit a mechanical trans- 
lation into machine language. 

The purpose of this investigation is to accomplish 
the following: 

1 . Conduct research designed to develop a theory 
for algorithmic (programming) languages. 

2. Develop suitable mathematical models of 
currently used mathematical languages such as 
ALGOL, COBOL, and JOVIAL. 

3. Use the mathematical models to answer 
questions of interest about these languages. 



^Supported in part by the Air Force Cambridge 
Research Laboratories, Office of Aerospace 
Research, and the Air Force Office of Scientific 
Research, Office of Aerospace Research. 



Progress 

Computer algorithmic languages are formal 
languages; that is, they consist of a formal 
syntax for deriving the meaningful units of 
expression such as words, clauses, sentences, 
arithmetic expressions, etc. Several grammatical 
systems for deriving the syntax of formal languages 
are in the literature. These give rise to various 
families of formal languages such as the recursively 
enumerable sets, context sensitive languages, 
context free languages, and regular sets. Formal 
languages are also defined by special kinds of 
acceptors such as the finite-state acceptors and 
the pushdown acceptors. Both grammatical and 
acceptor-based methods of defining formal 
algorithmic languages have been investigated as 
have the languages obtained by these methods. 

Of all the models used to consider programming 
languages, the most universally accepted one is 
that of the context free language (i.e. , a language 
defined by Backus normal form) . Four of the five 
technical reports written during 1966 concern 
this model. In [).]! context free grammars are 
considered in which indexed brackets are inserted 
around the right-hand side of the rules in the 
grammar. The resulting language, called "bracketed," 
appears to be a natural component in the theory of 
transformational grammars, a topic of concern in 
natural languages. In the report, an algebraic 
condition is given for one bracketed language to 
be a subset of another. It is also shown that 
the intersection and difference of two bracketed 
languages with the same brackets and terminals is 
a context free language. Report [2] concerns a 
special family of context free languages that 
arose from mathematical considerations. In it, 
two characterizations of bounded regular sets 
are given. In addition, certain connections with 
items of mathematical interest are noted. In [4] , 
partial algorithms for context free grammars are 
considered. The partial algorithms considered 
here are of the following form: "Suppose a certain 
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